Performance Tournaments with Crowdsourced Judges

نویسندگان

  • Daryl Pregibon
  • William D Heavlin
چکیده

A performance slam is a competition among a xed set of performances whereby pairs of performances are judged by audience participants. When performances are recorded on electronic media, performance slams become amenable to audiences that watch online and judge asynchronously ( crowdsourced ). In order to better entertain the audience, we want to show the better performances ( exploitation ). In order to identify the good videos, we want to glean a least some information about all videos ( exploration ). Our approach has three elements: (1) We take our preference model from Bradley and Terry (1952). (2) Its parameters we calculate by rewriting the likelihood gradient into a xed point estimate, one which mimics the estimate of Mantel and Haenszel (1959). (3) Each pair of performances is chosen sequentially, always chosen to minimize the weighted variance of (the logarithms of) the Bradley-Terry parameter estimates. Our preferred weights consist of the logrank weights proposed by Savage (1956).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

IRT-based Aggregation Model of Crowdsourced Pairwise Comparison for Evaluating Machine Translations

Recent work on machine translation has used crowdsourcing to reduce costs of manual evaluations. However, crowdsourced judgments are often biased and inaccurate. In this paper, we present a statistical model that aggregates many manual pairwise comparisons to robustly measure a machine translation system’s performance. Our method applies graded response model from item response theory (IRT), wh...

متن کامل

Human versus Machine Attention in Document Classification: A Dataset with Crowdsourced Annotations

We present a dataset in which the contribution of each sentence of a review to the reviewlevel rating is quantified by human judges. We define an annotation task and crowdsource it for 100 audiobook reviews with 1,662 sentences and 3 aspects: story, performance, and overall quality. The dataset is suitable for intrinsic evaluation of explicit document models with attention mechanisms, for multi...

متن کامل

Evidence of nationalistic bias in muaythai.

MuayThai is a combat sport with a growing international profile but limited research conducted into judging practices and processes. Problems with judging of other subjectively judged combat sports have caused controversy at major international tournaments that have resulted in changes to scoring methods. Nationalistic bias has been central to these problems and has been identified across a ran...

متن کامل

A Human-Centered Framework for Ensuring Reliability on Crowdsourced Labeling Tasks

This paper describes an approach to improving the reliability of a crowdsourced labeling task for which there is no objective right answer. Our approach focuses on three contingent elements of the labeling task: data quality, worker reliability, and task design. We describe how we developed and applied this framework to the task of labeling tweets according to their interestingness. We use in-t...

متن کامل

Hollywood Comes East to Take Your Portrait

The participants from State include: George Christy, Thomas Godard, Harold Vaughn, Seniors;; Walter Parmer '51. Representing State in the Congressional Discussion will be George Christy, Thomas Goddard a n d Harold Vaughn, Seniors, and Paul LeBrun, Walter Farmer and Edwin Kurlander, Juniors. Miss Elnora M. Drafahl, Instructor in English, will act as one of the judges. Two members of the group, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013